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.!_H Abstract. This paper introduces a simple formalism for dealing with deterministic, non- 

j^ deterministic and stochastic cellular automata in an unified and composable manner This formalism 

H allows for local probabilistic correlations, a feature which is not present in usual definitions. We 

show that this feature allows for strictly more behaviors (for instance, number conserving stochastic 
cellular automata require these local probabilistic correlations). We also show that several problems 
which are deceptively simple in the usual definitions, become undecidable when we allow for local 
probabilistic correlations, even in dimension one. Armed with this formalism, we extend the notion 
of intrinsic simulation between deterministic cellular automata, to the non-deterministic and stochas- 
tic settings. Although the intrinsic simulation relation is shown to become undecidable in dimension 
two and higher, we provide explicit tools to prove or disprove the existence of such a simulation 
between any two given stochastic cellular automata. Those tools rely upon a characterization of 
equality of stochastic global maps, shown to be equivalent to the existence of a stochastic coupling 
between the random sources. We apply them to prove that there is no universal stochastic cellular 
automaton. Yet we provide stochastic cellular automata achieving optimal partial universality, as 
well as a universal non-deterministic cellular automaton. 
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1. Introduction 

A motivation: stochastic simulation. Cellular Automata (CA) are a key tool in simulating natural 
phenomena. This is because they constitute a privileged mathematical framework in which to cast the 
simulated phenomena, and they describe a massively parallel architecture in which to implement the 
simulator. Often however, the system that needs to be simulated is a noisy system. More embarrassingly 
even, it may happen that the system that is used as a simulator is again a noisy system. The latter is 
uncommon if one thinks of a classical computer as the simulator, but quite common for instance if one 
thinks of using a small scale model of a system as a simulator for that system. 

Fortunately, when both the simulated system and the simulating system are noisy, it may happen that 
both effects cancel out, i.e. that the noise of the simulator is made to coincide with that of the simulated. 
In such a situation a model of noise is used to simulate another, and the simulation may even turn out to 
be. . . exact. This paper begins to give a formal answer to the question: When can it be said that a noisy 
system is able to exactly simulate another? 

This precise question has become crucial in the field of quantum simulation. Indeed, there are many 
quantum phenomena which we need to simulate, and these in general are quite noisy. Moreover, only 
quantum computers are able to simulate them efficiently, but in the current state of experimental physics 
these are also quite noisy. Could it be that noisy quantum computers may serve to simulate a noisy 
quantum systems? The same remark applies to Natural Computing in general. Still, the question is 
challenging enough in the classical setting. 

A challenge: the need for local probabilistic correlations. The first problem that one comes across 
is that stochastic CA have only received little attention from the theoretical community. When they have 
been considered, they were usually defined as the application of a probabilistic function uniformly across 
space ISl [mill [21^191. In this paper we will refer to this model as local Correlation-Free CA (CFCA). 
Indeed, this particular class of stochastic CA has the unique property that, starting from a determined 
configuration, the cell's distributions remain uncorrected after one step. This was pointed out in [1^, 
which provides an example (cf. Parity stochastic CA which we will use later) which cannot be realized 
as CFCA, in spite of the fact that they require only local probabilistic correlations and hence fit naturally 
in the CA framework. Moreover, ||T1 points out that the composition of two CFCA is not always a CFCA. 
The lack of composablity of a model is an obstacle for defining intrinsic simulation, because the notion 
must be defined up to grouping in space and in time. In [ 1] a composable model is suggested, but it lacks 
formalization. 

In this paper we propose a simple formaUsm to deal with general stochastic CA. The formalism relies on 
considering a CA F{c, s) fed, besides the current configuration c, with a new fresh independent uniform 
random configuration s at every time step. This allows any kind of local probabilistic correlations and 
includes in particular all the examples of HI. As it turns out, the definition also captures deterministic 
and non-deterministic CA (non-deterministic CA are obtained by ignoring the probability distribution 
over the random configuration). 

Results on stochastic simulation. This formalism allows us to extend the notions of simulation de- 
veloped for the deterministic setting ISlIll, to the non-deterministic and stochastic settings. The choice 
of making explicit the random source in the formalism has turned out to be crucial to tackle the second 
problem, as it allows a precise analysis of the influence of randomness, in terms of simulation power. 



The second problem that one comes across is that the question of whether two such stochastic CA are 
equal in terms of probability distributions is highly non-trivial. In particular, we show that testing if 
two stochastic CA define the same random map becomes undecidable in dimension 2 and higher (The- 
orem 4.1 1. Still, we provide an explicit tool (the coupling of the random sources of two stochastic CA) 
that allows to prove (or disprove) the equality of their probability distributions. More precisely, we show 
that the existence of such a coupling is strictly equivalent to the equality of the distribution of the random 



maps of two stochastic CA (Theorem 4.5 1. 



The choice of making explicit the random source allows us to show some no-go results. Any stochastic 
CA may only simulate stochastic CA with a compatible random source (where compatibility is expressed 



as a simple arithmetic equation, Theorem 8.2 1. It follows that there is no universal stochastic CA (Corol- 



lary [83]l. Still, we show that there is a universal CA for the non-deterministic dynamics (Theorem |8.4[ ), 
and we are able to provide a universal stochastic CA for every class of compatible random source (The- 
orem 



8.5 1 



Results on questions of computability versus allowing for local probabilistic correlations. The fact 
that testing if two stochastic CA define the same random map is undecidable in dimension 2 and higher, 
which is not the case in the particular case of CFCA, suggested that many problems appear deceptively 
simple in the CFCA formalism. And indeed, their difficulty comes back as soon as one iterates the 
CFCA: for instance, we show that in dimension 2 and higher, it is undecidable whether the squares of 
two CFCA define the same random map (Corollary |5.8[ ). Worse even, it is undecidable whether the 
square of a CFCA is noisy (Theorem |5.7| ). 
In dimension one, these problems, and many others, are shown to be decidable for general stochastic 



CA (Corollary 6.2 and|6.8|). Thus, they cannot serve to point out a separation with CFCA. Yet, we show 



that the Pattern-Probability-Threshold problem (i.e. the question whether some pattern can appear with 



a probability higher that some threshold) is again undecidable for stochastic CA (Theorem 4.9 1, whereas 



it is decidable for CFCA (Theorem |6.5[). We also show that some behaviors are out-of -reach of CFCA. 



Indeed, CFCA cannot be number-conserving unless they are deterministic (Lemma 5.3 1. Moreover, even 



the iterates of a CFCA cannot be surjective number-conserving unless they are deterministic (Theorem 



5.4 1. Iterates of two-state CFCA cannot reproduce behaviors alike the Parity example either (Theorem 



5.5 1. 



Plan. Section |2] recalls the vital minimum about probability theory. Section [3] states our formalism. 
Section[4]explains that some global properties of the SCA (such as equality of stochastic global functions, 
the probability of appearance of certain patterns, injectivity and surjectivity) cannot be decided from the 
local rules, but may be approached through some other proof techniques. These problems tend to simplify 
for the particular case of local Correlation-Free SCA, which corresponds to the more usual definition, 
and for the one-dimensional case, as explained in Sections |5] and [6] This shows that local Correlation- 
Free SCA are fundamentally simpler: some behaviors cannot be reached. Section [TJextends the notion of 
intrinsic simulation to the non-deterministic and stochastic settings. Section [8] provides the no-go results 



in the stochastic setting, the universality constructions. Section 10 concludes this article with a list of 
open questions. 



2. Standard Definitions 

Even if this article focuses mainly on one-dimensional CAfor the sake of simplicity, it extends naturally 
to higher dimensions. Each time a result is sensitive to dimension, it will be explicited in the statement. 

For any finite set A we consider the symbolic space A^. For any c G A^ and 2: G Z we denote by 
Cz the value of c at point z. A^ is endowed with the Cantor topology (infinite product of the discrete 
topology on each copy of A) which is compact and metric (see 1 19] for details). A basis of this topology 
is given by cylinders which are actually clopen sets: given some finite word u and some position z, the 
cylinder [u]z is the set [u]z = {c G A^ : Vx, ^ x < |n| — 1, Cz+x = Ux}- 

We denote by ^A{A^) the set of Borel probability measures on A^. By Caratheodory extension 
theorem, Borel probability measures are characterized by their value on cylinders. Concretely, a measure 
is given by a function fj, from cylinders to the real interval [0, 1] such that fJ.{A^) = 1 and 

Vn G A*,yz G Z, /^([m]^) = '^iJ,{[ua]z) = '^ iJ.{[au]z-i) 

We denote by va the uniform measure over A^ (s.t. z^a(Mz) = <a]\u\ )■ We shall denote it as z/ when 
the underlying alphabet A is clear from the context. 

We endow the set A4{A^) with the compact topology given by the following distance: 
5^(/Wi, Ai2) = Z^n^o^ " ■ ™9'XueA2"+i|A*i(M-n) — /^2(M-n)|- See ||27]| for a review of works on cel- 
lular automata from the measure-theoretic point of view. 

3. Stochastic Cellular Automata 

Non-deterministic and stochastic cellular automata are captured by the same syntactical object given 
in the following definition. They differ only by the way we look at the associated global behavior. 
Moreover, deterministic CA are a particular case of stochastic CA and can also be defined in the same 
formalism. 

3.1. The Syntactical Object 

Definition 3.1. A stochastic cellular automaton A = {Q, R, V, V, /) consists in: 

• a finite set of states Q 

• a finite set R called the random symbols 

• two finite subsets of Z: V = {vi, . . . ,Vp} andF' = {v'^, . . . ,v',}, called the neighborhoods; pand 
p' are the sizes of the neighborhoods and k = uiaK-y^vuV 1^1 is the radius of the neighborhoods. 

• a local transition function f : QP x R^ ^ Q 

A function c G Q^ is called a configuration; Cj is called the state of the cell j in configuration c. A 
function s G R^ is called an R-configuration. 

In the particular case where V = {0} (i.e., where each cell uses its own random symbol only), we 
say that ^ is a Correlation-Free Cellular Automaton (CFCA for short). 



Definition 3.2. (Explicit Global Function) 

To this local description, we associate the explicit global function F : Q^ x i?^ — )■ Q'^ defined for any 
configuration c and i?-configuration s by: F(c, s)^ = f({cz+vi, ■ ■ ■ ,Cz+vp), {sz+v' > • • • i ^z+v' ))• Given 
a sequence (s*) of /^-configurations and an initial configuration c, we define the associated space-time 
diagram as the bi-infinite matrix {c\)t>n pz ^^ere c* G Q^ is defined by c" = c and c*^^ = -F(c*, s*). 

We also define for any t ^ 1 the t"' iterate of the exphcit global function F^ : Q^ x (i?^) — > Q^ by 
F'^(c) = c for all configuration c and 

F'^Hc,s\...,s'+') = F{F\c,s\...,s'),s'+') 

so that c* = F\c,s^,.. .,s*). 

In this paper, we adopt the convention that local functions are denoted by a lowercase letter (typ- 
ically /) and explicit global functions by the corresponding capital letter (typically F). Moreover, we 
will often define CA through their explicit global function since details about neighborhoods often do 
not matter in this paper. 

The explicit global function captures all possible actions of the automaton on configurations. This 
function allows to derive three kinds of dynamics: deterministic, non-deterministic and stochastic. 

3.2. Deterministic and Non-Deterministic Dynamics 

Deterministic. The deterministic global function Vp : Q^ — )■ Q^ of ^ = (Q, R, V, V, /) is defined 
by T>p{c) = F(c, 0^) where is a distinguished element of R. A is said to be deterministic if its local 
transition function / does not depend on its second argument (the random symbols). 

Non-Deterministic. The non-deterministic global function Np '■ Q^ — > V{Q^) of A is defined for any 

configuration c G Q^ by Mp{c) = {F{c, s) : s e R^}. 



Dynamics. The deterministic dynamics of A is given by the sequence of iterates {Vp)t^Q. Similarly 
the non-deterministic dynamics of A is given by the iterates Mp : Q^ — )■ V{Q'^) defined by Mp{c) 

Jc'eA/'*(c)- 



{c} andAA|+i(c) = Uc'eAT* (c) AAf(c'). 



3.3. Stochastic Dynamics 

The stochastic point of view consists in taking the /^-component as a source of randomness. More 
precisely, the explicit global function F is fed at each time step with a random uniform and independent 
i?-configuration. This defines a stochastic process for which we are then interested in the distribution of 
states across space and time. By Caratheodory extension theorem, this distribution is fully determined 
by the probabilities of the events of the form "starting from c, the word u occurs at position z after t steps 
of the process". Formally, for t = 1, this event is the set: 

£cM^ = {s£R^:F{c,s)Ci[u]z]. 

In order to evaluate the probability of this event, we use the locality of the explicit global function 
F. The event "F(c, s) G [u]z" only depends of the cells of s from position a = z — k to position 



b = z + k + \u\ — I. Therefore, if J = {v € R^^"- : F{c, [v]a) ^ [u]z}, then £c,[u]z = ^v<^,j[v]a and 
hence £c,[u]^ is a measurable set of probabiUty: iyR{£c,[u]J = J2veJ ^^(Ha) = l-^l/l-^l^ " (recall that 
uji is the uniform measure over R^). 

More generally to any CA A we associate its stochastic global function Sp '■ Q^ — ^ ^A{Q^) defined 
for any configuration c G Q* by: Vu G Q^, Vz G Z, 

{Sf{c)){[u]z) = iyR{£c,[u]J = the probability of event £c,[u],- 

Example. For instance, consider the stochastic function Parity that maps every configuration over the 
alphabet {0, 1, #} to a random configuration in which every {0, l}-word delimited by two consecutive 
# is replaced by a random independent uniform word of length with even parity. This cannot be realized 
by a CFCA. Still, one can realize the stochastic function Parity as a stochastic CA with Q = {#, 0, 1}, 
R = {0, 1} and local rule /: g{-^'0'^> X ii^-i'O} ^ g given by: for all c_i,co, ci, s_i, sq G {0,1} 
and a, 6 G {#,0,1}, 

/(a#6,s_iSo) = # /(#co#,s-iSo) = /(#coCi,s_iSo) = so 

/(c_iCo#, S_iSo) = S-l /(c_iCoCi, S-iSo) = S-i + So 

One can easily check that this local probabilistic correlations ensures that every word delimited by two 
consecutive # is indeed mapped to a uniform independent random word of even parity. 

Dynamics. As opposed to the deterministic and non-deterministic setting, defining an iterate of this 
map is a not so trivial task. There are two approaches: defining directly the measure after t steps or 
extending the map Sp to a map from A4{Q^) to itself. Both rely crucially on the continuity of F. In 
particular, we want to make sure that the definition of the measure after t steps matches t iterations of 
the one-step map, and hence, is independent of the explicit mechanics of F but depends only on the map 
Sp defined by F. 

The easiest one to present is the first approach. For any t ^ 1, the event <S* r i that the word u appears 
at position z at time t from configuration c consists in the set of all t-uples of random configurations 
(s^, . . . , s*) yielding u at position z from c, i.e.: 

4m. = {(^''•••'^*) e {Ry ■■ F\c,s\...,s') G N4 

As before <S* r , is a measurable set in (i?^) because it is a product of finite unions of cylinders by the 
locality of F. We therefore define Sp : Q^ — )■ M-{Q^), the iterate of the stochastic global function, by: 

{S^p{c)){[u],) = VRt (£^*j„] J = the probabiUty of event £:* [„]^ 

where vjit denotes the uniform measure on the product space (i?^) . For similar reasons as above, 5^(c) 
is a well-defined probability measure. 

For all t ^ and all words u G Q" with n ^ 2kt + 1, we will also denote by F*{u) the random 
variable for the random image v G Q"^^'^* of u by F*, defined formally as: for all u G Q" and v G 

f)n—2kt 

Vt{F\u) = v] = {Sy{c)){[v]kt), for any c G Mq. 



The following key technical fact ensures that two automata define the same distribution over time as 
soon as their one-step distributions match. 

Fact 3.3. Let A and B be two stochastic CA with the same set of states Q (and possibility different 
random alphabet) and of explicit global functions F and G respectively. If Sp = Sg then for all t ^ 1 
we have Sp = Sq 

Proof: 

The proof is written for ID CA to simplify notations but it extends to any dimension in a straighforward 
way. Consider a CA of explicit global function F. Consider a word u and a position z. Let : Q^ — )■ 
V{R^) be function that associates to a configuration c the event £c,[u]z- 'Pic) is entirely determined by 
the states of the cells from positions a = z — k to b = z + \u\ + k in c (locality of F). Therefore is 
constant over every cylinder [v]a with v G Q^~". If we distinguish some c^ G [v]a for every v G Q'^^"', 
we obtain by definition of F* and continuity of F: 



^tih - U (^ilv]a ^ ^ 






Then, since sets (<5* r , ) pn^-a ^^ pairwise disjoint (because F is deterministic and cylinders \v\a are 
pairwise disjoint), we have 

= E (5kc))(Ha)-(5^(c.))(M.) 
v£Q>>-<^ 

The value of Sp{c) over cylinders can thus be expressed recursively as a function of a finite number of 
values Sp over a finite number of cylinders. It follows that if for some pair of CA A and B with explicit 
global functions F and G we have Sp = Sq, then Sp = Sq for all t. D 

In our setting one can recover the non-deterministic dynamics from the stochastic dynamics of a 
given stochastic CA. This heavily relies on the continuity of explicit global functions and compacity of 
symbolic spaces. 

Fact 3.4. Given two CA with same set of states and explicit global functions Fa and Fb, if Sp^^ = Spg 
then7V>^ = Mpg. 

Proof: 

Given some stochastic CA of explicit global function F, some configuration c and some cylinder [u]z we 
have 

Mp{c) n [u], / 4^ £:,j„]^ / 4^ {Sp{c)){[u],) > 

by definition of Mp and Sp. Since Mp{c) is a closed set (continuity of F) it is determined by the set 
of cylinders intersecting it (compacity of the space). Hence Mp{c) is determined by Sp{c). The lemma 
follows. D 



4. From Local to Global 

It is well-known in deterministic CA that determining global properties from the local representation is 
a generally hard problem. The purpose of this section is to show that the situation is even worse for 
stochastic CA. 

4.1. Equality of random maps: undecidability and explicit tools 

An undecidable task for dimension 2 and higher. In the classical deterministic case, it is easy to de- 
termine whether two CA have the same global function. Equivalently determining whether two stochastic 
CA, as syntactical objects, have the same explicit global functions F and G is easy. However, given two 
stochastic CA which have possibly different explicit global function F and G, it still may happen that 
Mf = A/g or Sp = Sg, and determining whether this is the case turns out to be a difficult problem. 



In fact. Theorem 4.1 states that these two decision problems are at least as difficult as the surjectivity 



problem of classical CA. 

Theorem 4.1. Let Vn (resp. Vs) be the problem of deciding whether two given stochastic CA have 
the same non-deterministic (resp. stochastic) global function. The surjectivity problem of classical 
deterministic CA is reducible to both Vn and Vs- 

Proof: 

The proof is written for ID CA to simplify notations but it extends to any dimension in a straighforward 
way. Consider a classical CA F : Q^ — )• Q^ and define fip as the image by F of the uniform measure 

po on Q^: 

It is well-known that F is surjective if and only if fip = u (this result is true in any dimension; the proof 
for dimension 1 is in [16] and follows from [i22il for higher dimensions, but we recommend i27il for a 
modem exposition in any dimension). 

Now let us define the stochastic CA ^ = {Q,Q,V,V',g) such that G{c,s) = F{s). With this 
definition, A is such that, for all c, Sg{c) = jip- Hence, Sg{c) is the uniform measure for any c if and 
only if F is surjective. We have also Mg{c) = Q^ for all c if and only if G is surjective. The theorem 
follows since A is recursively defined from F. D 



Surjectivity of classical CA is an undecidable property in dimension 2 and higher PTS]. As an imme- 
diate corollary, we get undecidability of equality of global maps in dimension 2 and higher. 

Corollary 4.2. Fix d ^ 2. Problems V^ and Vs are undecidable for CA of dimension d. 

Explicit tools for (dis)proving equality. Even if testing the equality of the non-deterministic or 
stochastic dynamics of two stochastic CA is undecidable for dimension 2 and higher. Theorem 4.5 states 
that equality, when it holds, can always be certified in terms of a stochastic coupling. Indeed the stochas- 
tic coupling, by matching their two source of randomness, serves as a witness of the equality of the 
stochastic CA. This provides us with a very useful technique, because the existence of such a coupling 
is easy to prove or disprove in many concrete examples. Again the result heavily relies on the continuity 
of the explicit global function F. 



Let us first recall the standard notion of coupling. 

Definition 4.3. Let /^i G M-{Qf) and fi2 S 7W((5f)- A coupling of /xi and fi2 is a mea- 
sure 7 G M{Qf X Qf) such that for any measurable sets £i and £2, l{£i xQf) = /Ui('?i) and 

7(Q?x£:2) = /U2(£:2). 

Concretely, a coupling couples two measures so that each is recovered when the other is ignored. The 
motivation in defining a coupling is to bind the two distributions in order to prove that they induce the 
same kind of behavior: for instance, one can easily couple the two uniform measures over {1,2} and 
{1, 2, 3, 4} so that with probability 1, both numbers will have the same parity (7 gives a probability 1/4 
to each pair (1, 1), (2, 2), (1, 3) and (2, 4) and to the others). This demonstrates that the parity function 
is identically distributed in both cases. 



Theorem 4.5 states that the dynamics of two stochastic CA are identical if and only if there is a 
coupling of their random configurations so that their stochastic global functions become almost surely 
identical. This is one of our main results. 

Definition 4.4. Two stochastic cellular automata, ^1 = {Q,Ri,Vi,V{, fi) and A2 = 
(Q, -R2 5 ^2, V2, f2), with the same set of states Q are coupled on configuration c S Q^ by a measure 

-f e M{Rf X R^) if 

1. 7 is a coupling of the uniform measures on Rf and i?f ; 

2- l{{{si,S2) G i?f X i?2 • -^i(c, Si) = F2{c, S2)}) = 1, i.e. Fi and F2 produce almost surely the 
same image when fed with the 7-coupled random sources. 

Note that the set of pairs (si, S2) defined above is measurable because it is closed (Fi and F2 are contin- 
uous). 

Theorem 4.5. Two stochastic CA with the same set of states have the same stochastic global function if 
and only if, on each configuration c, they are coupled by some measure 7c (which depends on c). 

Outline of the proof. We fix a configuration c. By continuity of the explicit global functions, we construct 
a sequence of partial couplings (7") matching the random configurations of finite support of radius n. 
We then extract the coupling 7c from (7^) by compacity of M{R\ x R^). 

Proof: 

The proof is written for ID CA to simplify notations but it extends to any dimension in a straighforward 
way. First, if Ai = {Q, Ri,Vi, V{, /i) and A2 = {Q, R2, V2, V2, /2) are coupled by 7c on configuration 
c G Q'^, consider for any cylinder [u]z the sets 

£i = {s£Rf: Flics) £ [n],} 
£2 = {seR^:F2{c,s)e [u],} 
X = {(si, S2) G i?f X i?| : Fi(c, si) = ^2(0, S2)} 

Then, by the property of the coupling by 7c, we have 

{SFAc)){[u]z) = yi{£i)=ic{£ixRl) 



where ui is the uniform measure on Rf. But 7c(<?i xi?|) = 7c ((^i xi?|) n X) since 7c(-'^) = 1- 
Symmetrically we have 

{Sp,{c)){[u],)=-fc{{Rfx£2)nX). 

But, by definition of sets £i, £2 and X, we have Rf x <S2 nX = <Si xi?| n X. We conclude that 5fi = 
Sf2 ■ For the other direction of the theorem, suppose Spi = Sf2 and fix some configuration c. We denote 
by ^ the measure Sfi{c) = Sf2{c)- Without loss of generality we can suppose that Ai and A2 have 
same radii r: ri = r2 = r. We construct a sequence (7") of measures from which we can extract a limit 
point (by compacity of the space of measures) which is a valid coupling of Ai and A2 on configuration 
c. To simplify the proof we focus on centered cylinders: for any word w of odd length, we denote by 



\w\ = \w\ 



where Zw = —-^ — • Let's fix n. For any word u G Q^"^^ we define: 



Zu, """-i"- ^tu — 2 



5i = {sGi?f :Fi(c,.)gM} 
Sl = {sCiRl:F2{c,s)Ci[u]} 

Fi and F2 being of radius k we can write S"^ as a finite union of centered cylinders of length 

2{n + k) + l: 

S'u=[j [v] 

where P^ '^ R^ . Define the following partition I^ of the real interval [0, 1) by: 

rank(i;) rank(f) + 1 



I» 



*Pl ' *P'u 



where rank(t') G {0, . . . , ij^P^ — 1} is the rank of v in some arbitrarily chosen total ordering of P^ (the 
lexicographical order for instance). Since the sets P^ form a partition of R^ when u ranges over 

all words of Q^"+^, we have for any v €^ R- : 

\P (v)\ = J- = "^iMl 
(recall that Vi stands for the uniform measure over i?f). Now, for every v G R^ and 

2 ^ n2(n+fc) + l ^ ^ n 

V G /I2 , we construct 7 as: 

nn v^ ^■M-._ }s-uy- ,- -uy- n ^q(M) if 3n s.t. t;^ G P^ for both i = 1, 2 

otherwise. 




1 ^ r>2m+l 



Furthermore, if Oj is a distinguished element of Ri, we extend the definition of 7" to any pair t; G i?^ 

'-2 



and -u^ G Pg™^^ with m ^ n + A; by: 



I else. 



By (T-additivity 7" is thus defined on any cylinder and by extension theorem is a well-defined measure. 
Now by construction, we have for any v^ G ^2(n+fc)+i. 

i''{[v'],R^) = \II{v')\-hq{[u]) 

for some u such that i;^ G P^. Hence, 7"([t;^], i?|) = i^i([t'^]). By a-additivity of 7"^ and ^, this equality 
holds for any v^ G j^2m+i ^^^^ m ^ n + k. Symmetrically we have 7"(i?f , [v"^]) = A*(72(b^]) for ^^ly 
^1 g ^2m+i Moreover, by definition, 7"([f ^], [v"^]) = if there is no u such that f* G P^ for i = 1, 2. 
We deduce that the set: 

Xn = yj D„ X 6^ 

has measure 1. More precisely, since Xn+i Q Xn, for any m ^ n, 7"(Xm) = 1- 

To conclude the proof, let 7 be any limit point of the sequence (7"^)™. By the definition of the distance 

on the space of measures, we have: 

1. ym,\/w G R1"^~^^,j{[w], i?!) = ^ili""^]) ^rid symmetrically for the R2 component, hence 7 is a 
coupling of uniform measure on Rf and i?|; 

2. Vn, 'y{Xn) = 1 hence 7(n„X„) = 1 where 

f|X„ = X = {(si,S2) G i?? X iif : Fi(c,si) = ^2(0,52)} 
n 

We deduce that ^1 and A2 are coupled on c by measure 7. D 

Notice that the proof of this theorem is non-constructive (recall that equality of stochastic global 
maps is undecidable in dimension 2 and higher). Moreover, it is easy to get convinced on a simple 
example that the coupling must depend on the configuration. Consider the two following automata with 
states Q = R = {0,1} and neighborhoods V = V = {0}: A with explicit global function F{c, s) = s 
and B with explicit global function G{c' , s') = c' + s' mod 2. Clearly, both A and B define the same 
blank noise CA and the coupling proving this fact is defined for all z ^ Z and all a,b ^ {0, 1} by 
7c([a]2, [b]z) = 1/2 if and only if a = 6 + c^ mod 2, and = otherwise. This coupling demonstrates 
indeed that 7c({(s, s') : F{c, s) = G{c, s')}) = 1 yielding that the dynamics are identical; but note that 
7c must depend on c. 

4.2. Other Undecidable Properties 

Of course, stochastic CA inherit many undecidable properties from deterministic CA since they are a 
generalization of them. However, with the stochastic formalism new global properties can be considered 
together with their associated decision problem. For instance, we say that a CA F is noisy if it may reach 
any configuration from any configuration, i.e. if A/i?(c) = Q^ for all c G Q'^. Surprisingly, this basic 
property is undecidable in dimension two. 

Proposition 4.6. It is undecidable to determine whether a given CA of dimension d ^ 2 is noisy. 



Proof: 

Using the construction of Theorem|4.1[ we show that the surjectivity problem reduces to testing noisiness. 



Hence it is undecidable starting from dimension two llTSl . D 

By contrast, testing whether a CA is deterministic or not is decidable in any dimension. 

Proposition 4.7. A CA F of local function f : Qp x Rf -^ Q is deterministic if and only if 
f {u,v) = f{u,v') for all u,v and v'. Hence testing whether a CA is deterministic is decidable in any 
dimension. 

Proof: 

If the condition on / is verified then clearly F is deterministic. Conversely, if there are some v and v' 
such that f{u, v) / f{u, v') then it is straightforward to construct infinite configurations c in Q^ and si 
and S2 in R^ such that F(c, si) ^ F{c, S2) contradicting determinism. D 

The two undecidable problems presented so far (equality of global maps and noisiness) were shown 
undecidable for general stochastic CA starting from dimension 2. Section [6] below shows how these 
problems become decidable when we restrict to one-dimensional CA. We now give an example of a 
basic problem which is undecidable starting from dimension 1. 

Pattern Probability Threshold Problem. Given a SCA F, a language L over Qp and a threshold 
function 1? : L — ;■ [0, 1], the problem is to determine whether there is an initial configuration c and a 
word u £ L such that the image of c by F matches in one step the word u at position with a probability 
above ^{u). We will typically ask -d to decrease exponentially with the length of u. The problem is 
parametrized by L and ?? and is too general without additional restriction. In the sequel we will focus on 
the following restriction of this problem where the patterns to be matched are of the form x ■ y^ ■ z and 
1!} is exponentially decreasing in the length of the pattern, called PPT: 

Definition 4.8. The problem PPT (Pattern Probability Threshold) is the following: 

Input: A stochastic CA F over a state set Q with |Q| ^ 4, three distinguished distinct symbols 

x,y, z £ Q, and a threshold function t9 : N — )■ [0, 1] 

Question: Is there a configuration c and n ^ 1 such that [Sf{c)){[x • y" • z]q) > ■d{n)l 
A threshold function t9 is computationally superexponential if: 

1. §{n) =w(A")forallA< 1; 

2. there is an algorithm that given any < ;U < 1 outputs a K such that for all n > K, {}{n) > ^" 

Theorem 4.9. 

1. The problem PPT is undecidable for a stochastic CA F when i?(n) = ^ ,^ ,„ , even when restricted 
to dimension 1 . 

2. If the threshold ^ is non-increasing and computationally superexponential and with a computable 
limit, then PPT is decidable for stochastic CA in dimension 1. 



The proof of this theorem reUes on the undecidabiUty of the existence of a word recognized with a 
probability higher than some fixed threshold (namely, ^ as we will see later on) by a given probabilistic 
finite automata A iflSl . The key is to encode into a stochastic CA the complete recognition process 
of the word written in the initial configuration c by ^ in one single step of the SCA, so that the word 
u = ci . . . c„ is recognized by A if and only if the pattern x ■ y"' ■ z appears in the image configuration 
at position 0. We will ensure that this happens with probability exactly Pr{n is recognized by ^}/|Q|" 
and thus the pattern x ■ y"' ■ z appears in the image configuration of c with probability at least 1/2|(5|" if 
and only if u is recognized by A with probability at least ^, proving the undecidability of PPT. 

Encoding probabilistic finite automata into CA, A probabilistic finite automaton A consists in a 
quintuple {Q, A, {Ma)aGA, I, ^) where: 

• ^ is the finite alphabet and Q the finite set of states; 

• / G Q is the initial state and F G Q the set of final states; 

• for each a G A, Ma G [0, 1]Q^*3 is a stochastic matrix giving the transition probabilities: 

- for each a and each gi, NJ Ma{qi,q2) = 1- 

92 

- Ma{qi, (72) is the probability to go from state qi to state g2 when reading letter a; 

An accepting path in the automaton for a word u G A" is a finite sequence {qi)o^i^n of states such 
that: 

• the first state go = ^ is the initial state of the automaton, 

• gn G J". 

The weight of a path for a word u, is the product of weights of the transitions of the path labeled by the 
succesive letters of u: 

W Mu^{qi_i,qi). 

The acceptance probability of a word u G Q" for A, P^ (u), is the sum of the weights of all accepting 
paths for n: 

q£Q^' 
qo = I, qn£ T 



in+l l^i^n 



Proposition 4.10. For any probabilistic finite automaton A with rational transition probabilities, state set 
Q and alphabet A, there exists a stochastic CA F of dimension 1 with states Qp = AU {O^, — )•, -/, _L} 
and random states Rp = Q x {1, . . . , m} (where m is the least comon multiple of the denominators of 
all transition probabilities) such that for all n ^ 1: 

1. (5f (c))([G>- ^- V]o) = if c [O • A" • y]o. 



2. {Sp{c)){[0^- ^" -yjo) = Fa (u) /\Qr if c G [G> • n • y]o with u e A^' 



Moreover, F can be obtained algorithmically from A. 



Proof: 

Informally, the random symbols will try to guess an accepting path for the word written in the initial 
configuration. Each random symbol will be our guess of the transition taking place in the automaton 
while reading the letter at that position in the initial configuration. Checking if this guess is correct 
can be done locally. It the path is indeed correct and accepting, then the SCA prints the configuration 
(3> — )•" n/; otherwise, it prints a _L at each place an error occurs. The exponential decay comes from 
the fact that the SCA needs to guess what is the state just before reading each letter, which implies a 
multiplicative shift of the success probability by l/jQI". 

Formally, let m be the least common multiple of the denominators of all transition probabilities and 
R = Q X {1, . . . , m}. Let r : A x Q x {1, . . . , m} — )■ Q an arbitrary function such that for all 
{q, q') G Q^, #{1 ^ i ^ m : T{a, q, i) = q'} = m ■ Ma{q, q') — it exists since all Ma{q, q') are integer 
multiples of 1/m and Yl,n'<^Q ^a{q, q') = 1 for all (a, q) ^ Ax Q. Intuitively, when a random symbol 
(g, i) is uniformly selected in i? at a cell with symbol a in the configuration, the SCA reads it as the 
guess that the transition that probabilistic automata follows when reading this letter a is g — > T{a, q, i) = 
q' , where q is chosen uniformly at random and q' is chosen with probability Ma{q,q'). Every given 
accepting path qo = I, . . . ,qn € J-" for a given word u G vl" written on the initial configuration is 
thus correctly guessed by the SCA with probability 1/|(5|" (for guessing the right sequence of states 
qi, . . . , qn) multiplied by fliLi ^"i (%-!> Qi) = "^A (^) for guessing the right transition at each letter. 
Let us now describe precisely the SCA. 

Next, define the local function / : qIt^'"^ x R'^-^'^^ ^ Qf of F as follows: 



f{c-iCo,{q^i,i-i){qo,io)) 

=s-l =so 



G> ifco = G>, 

— > if Co G A and qo = r(co, 7, i-i) with 7 

y if Co = n/ and g_i G -F, 

_ _L in every other case. 



I-i 



ifc_i = G>, 
otherwise 



By construction, F{c, s) G [Q^- —5-" ■y]o if and only if: 



1. cG [G> • ^" • y]o, and 



2. qi = t(ci,/, zo) andgfc = T{ck,qk^i,ik-i) ior 2 ^k ^ n, and 



3. qn G -F, 



where Sk = {qk,ik) for A: G N. Fix a configuration c G [(3> • ^" • ^/]o. and let n = ci . . . c„. Then, 

qi = t{ui,I, io) and g„ G J", and 



(5^(c))([0. ^" Vjo) = — ^ 



Qk = T{uk,qk-i,ik-i)ior2 ^ /c ^ n 



^ n— 1 

r^ X] #"L«0 : -r(ni,/,io) = ft} x JJ #{ifc : T{uk+i,qk,ik) = qk+i} 

1 "■ 

— Yl Ylm-Mu,{qk-uqk) 

((?o,gi,-,gn)e{/}xQ"-ixj-A:=i 



by definition of r, which concludes the proof. D 

Using classical undecidability results concerning acceptance threshold in probabihstic finite au- 
tomata we can now prove the Theorem stated earlier. 

Proof of the undecidability result in Theorem |4.9| 

In ifTSl . the following problem is shown undecidable: 

input: a probabilistic finite automaton with all transition probability in {0, ^, 1} 
question: is there a word accepted with probability at least j? 



Using Proposition 4.10 it is straightforward to check that this problem reduces to problem PPT. D 

Let us now show that if the threshold is superexponential in n, then one can decide PPT in dimen- 
sion 1. Assume that the threshold function i9 is non-increasing and superexponential (i.e. 'd{n) = ci;(A") 
for all A < 1). We will show the following structural lemma. Let us say that a word w G (5"+^~^ is 
(.-looping if w = ala for some a ^ Q^ and / G Q"' (recall that i = 2k + 1 where k is radius of the 
considered SCA). We say that a word u; G Q^ is non-deterministic for a SCA F if there are two random 
words s,s' G R^ such that f{w,s) / f{w,s'), and deterministic otherwise. By extension, we say a 
word w G Q""''^"^ is deterministic for F if all the subwords of length £ it contains are deterministic for 
F. 

rp 

To simplify notations, we denote by Prju — )• w} the probability SF{[v]-k)i[w]o) when v and w have 
appropriate lengths (\v\ = \w\ +2k). 

Lemma 4.11. If there are n G N and a word u G (5"+^+^ such that Ft{u — ;> xy^z} > "&{n), then there 

are n' G N and a word u' G Q" ^^^^ such that Pr{u' — )• xy"^ z} > 'd{n') with either 

• n' ^ K, or 

• n' = ^a{laY^' where |7| + |7'| ^ K,a ^ Q^, \l\ ^ \Q\^, q G N*, and the word aZa is deterministic 
forF. 



where i^ is a constant that can be algorithmically computed from (., \Q\, \R\ and -d. 
Proof: 

First note that by the pigeonhole principle, any word in Q* of length at least i. + \Q\^ contains a 
^-looping subword ala with a ^ Q^ and |^| ^ \Q\^- Let us isolate the centerpart of u by writing 
u = aba' where a, a' G Q^. Let us mark in h all the letters which are at the center of non-deterministic 
neighbourhoods. The marked letters split b into subwords where each letter belongs to a deterministic 
neighbourhood. 

Let us first assume that all of this deterministic subwords have length at most (. + \Q\^, then 
there are at least n/{(. + \Q\^) distinct letters at the center of non-deterministic neighbourhoods in b. 
Since neighbourhoods at distance at least i from each other evolve independently, and since every non- 
deterministic neighbourhood produces an error in the pattern with probability at least l/|i?|^, it follows 
that the image of n by F will be xy^z with probability at most (1 — l/|i?|^)"/^(^+IQI ) = ^" where 

// = (1 - l/|i?|^)i/^(^+IQI'). As Pr{ti 4- xy"z} > i9(n), it follows that ^u" < t?(n) but since i9 is 
superexponential and non-increasing, there is a constant K that can be algorithmically computed from 
IQI, |i?|,£ and -d (assuming an appropriate oracle for the superexponentiallity of -d) such that for all 
m> K, ■d{'m) > f/^. It follows that n ^ K and case 1 is verified for u' = u. 

Let us now assume that one of the deterministic (unmarked) subwords of b has length at least ^+ IQI^. 
it follows that it contains a ^-looping subword of length at most \Q\^. Note that one can strip or duplicate 
the loop la in any ^-looping subword ala in u: this will only affect the image (by adding or deleting some 
ys in the image) but will not change the probability of obtaining the pattern xy*z. We then strip in u all 
the loops in the ^-looping deterministic subwords but one and duplicate the remaining one as many times 
as necessary to obtain a word u' at least as long as u, i.e. so that u' has length £ + n' + 1 with n' ^ n. 
Since -d is non- increasing and since the probability to obtain xy" z from c' is identical to the probability 
to obtain xy^z from c, we have Pr{n' — )• xy"^ z} > 'd{n) ^ '9{n'). Now, u' has the form ^a{laY^' 
where ala is deterministic and all the deterministic subwords in 7 and 7' have length at most I + \Q\^- 
Using the same argument as before, the sum of the lengths of the two words 7 and 7' is bounded by the 
constant K and therefore u' has the desired properties for case 2. D 



Proof of the decidability part of Theorem 4.9 



Let 6 = lim„_5.oo i?(n). Consider a SCA F with state set Q, random symbol set R, radius k and neigh- 
borhood width (. = 2k + l. Let ;u = (1 - l/|i;|^)V<^(<^+IQI') and compute K such that d{m) > /x" 



for all m ^ K. According to Lemma 4.11 there are an n S N and a word u G Q""''^"''^ such that 



Pr{n — ;■ xy'^z} > '&{n) if and only if there is such a pair with n ^ i^ or there is a deterministic l- 
looping word ala of length at most £ + \Q\^ and two words 7, 7' of total length at most K and a (7 G N 
such that u = ^a{laY^' verifies the condition. One can easily check these conditions by enumerating all 
the subwords of length at most K and loops of length \Q\^, the only difficulty consists in computing the 
right value for q in the second case. This is achieved as follows: first compute the word u = ^ala-)' with 

I7I + It'I ^ ^ ^i^d V^\ ^ \Q\^ such that ala is deterministic for F and for which Prju — )• 2;yl"l~^'^~^z} 
is maximum. If Pr{u — )• xy'^'^^^^^z} ^ 6, then we conclude that the second possibility is not possible 
because pumping in the loop la does not change the probability to obtain the pattern xy*z since {la) is 

deterministic. If Pr{ii — )• xy'^'^^'^^^z} > 6, then using the monotonicity of {}, there must exist some 



q such that Pr{7a(/a)''7' — )• xy'^'^z} > 'd{nq) where Uq = I7I + |a| + (7 • |/a| + |7'| — 1 — £ (again 
because pumping in the deterministic loop la does not change probabilities). Then the second case is 
verified. D 

4.3. About the Garden of Eden Theorem 

One of the most celebrated theorems in the setting of deterministic CA states that surjectivity is equivalent 
to pre-injectivity (Garden-of-Eden Theorem, [4|). In the case of stochastic CA the situation is different 
as we will show in this section. 

Definition 4.12. Let F : Q^ x i?^ ^ Q^ be a stochastic CA. 

• F is surjective if for any c G Q^ there is some c' e Q^ and some s G E?' such that F{c' , s) = c. 

• F is injective if for any ci, C2 G Q^ and si, S2 G R^ we have 

F(C1, Sl) = F(C2, S2) =^ Cl=C2 

• F is pre-injective if for any ci , C2 G Q^ with finitely many differences and si , S2 G i?^ we have 

F(C1, Sl) = F(C2, S2) =^ Cl = C2 

Some remarks: 

1 . the definitions above agree with the classical deterministic setting when the stochastic CA consid- 
ered turns out to be deterministic. 

2. in general a stochastic system can be injective yet non-deterministic, for instance consider the 
following stochastic map / : [0, 1] — )• [0, 1] s.t. 

J 2;/3 with probability 1/2 

~ 1 1/2 + x/3 with probabihty 1/2 

3. it is easy to define some stochastic CA which is surjective but not injective (and not pre-injective), 
for instance F : Q^ x Q^ -> Q^ s.t. 

F{c,s) = s 

Contrary to the deterministic setting, there is no Garden-of-Eden Theorem for stochastic CA. How- 
ever, there aie still strong relationships between the notion defined above. 

Theorem 4.13. Let F be a stochastic CA, then we have: 

• if F is pre-injective then F is surjective; 

• if F is injective then F is deterministic. 



Proof: 

The proof is written for ID CA to simplify notations but it extends to any dimension in a straighforward 
way. For any n > and any word w G R"' we define the deterministic CA F^ : (Q")^ — ^ (Q")^ as the 
grouping by groups of size n of the map: 

cG Q^^ F{c,w) 

where w is the periodic configuration of period w (with wq on cell 0). 

First, since F is pre-injective, we have that, for any w, F^ is pre-injective (straightforward). Hence, 
by choosing some r € R, we deduce that Fr is surjective (by the Garden-of-Eden theorem). Therefore 
F is also surjective, which proves the first assertion of the theorem. 

Now suppose in addition that F is injective. Then for any n > and any wi,W2 S R^ we have 
F^j = Fw2- Indeed, if it was not the case we would have some c G {Q"")^ such that F^^ (c) 7^ F^,^ (c). 
But since F^^ is surjective (shown above) there would exist some c' such that F^^i^^') = Fwi{c)- Since 
c' must be different from c (because F^^ (c) / Fw2 (c)) this contradicts the injectivity of F (grouping is 
a bijective operation and does not affect injectivity). 

To conclude the proof it is sufficient to take n large enough (larger than 2k + 1 where k is the radius 
of F): in this case F^^ = F^j for any wi,W2 means that F does not depend on its /^-component, hence 
it is deterministic. D 

5. Correlation-free local rules are simpler 

A stochastic CA A = {Q, R,V,V' , f) is correltation-free if its neighborhood associated to the random 



component is trivial: V' = {0} (see Definition 3.11. Letting p = \V\, its local function / is then of 
the form f : Q^ x R ^ Q and it can be seen has a map Ff from Qf to probability distributions over Q 
(maps from Q to [0, 1] summing to 1) as follows: 

™, ^ #{ae R: f{qi,...,qp,a)=j} 

Ff{qi, ...,qp):-f^ — '- 

|/i| 

Note that most of the Uterature concerning stochastic CA is restricted to local Correlation-Free CA and 
use map P/ to define them (32 (Ell 13129111 • 

As an immediate consequence of this form of local function, one can compute probabilities involved 
in the global function as a product of 'local probabilities' as shown by the following lemma. To simplify 
notations, it is stated in dimension 1 but extends without difficulty to higher dimensions. 

Lemma 5.1. If F is a local Correlation-Free stochastic CA of local function / and radius k, we have for 
all configuration c and all finite words n: 

(5f(c)) (Mo) = Y\ (lP/(c^-fc, • • • , C^+k)) (Uz+l) 

Ot^z<\u\-l 

Proof: 

It is sufficient to check that the set £c,[u]o (^^^ section 3.3 1 is a union of cylinders which is in one-to-one 
correspondence with the set 

JJ {s : f{c;,-k,---,Cz+k,s) =Uz+i}. 

0^2<|n|-l 



The lemma follows by application of the uniform measure on both sets. D 

5.1. Impossible behaviors 

We now present behaviors than can be realized by general stochastic CA but not by CFCA. 

5.1.1. Number-conserving CA 

Number-conserving C A are regularly used to model interacting particles (see d [H] [121 for the case of 
deterministic CA). A classic example of interacting particles model is the usual random walk, which is 
number-conserving because the number of walkers is conserved. Again, we restrict to dimension 1 to 
simplify notations but the extension to higher dimension is straightforward. 

Definition 5.2. A SCA F is number-conserving if Q = {0 ... g} for some g G N and for any finite 
configuration c G Q^ (i.e. a configuration with finitely many cells in a state other than 0), we have 

(5Hc))({c':5]c', = ^q}) = 1 
i i 

where the infinite sums are well-defined because only finite configurations are considered. 

Note in particular that the definition implies Sf{^0^){{^0'^}) = 1 and more generally, when c is 
a finite configuration and c' is reachable from c then Sf{c){{c'}) > (because there are only finitely 
many configurations reachable from c). 

Remark that our definition requires the number to be conserved almost surely and is thus more 
restrictive than the definition in ITTBI , which requires only the number to be conserved in expectation. The 
conclusion of [13|, leaves open the question of strictly conservative particle system in CFCA. We settle 
this question by showing that there is no CFCA (nor powers of CFCA) that can simulate (surjective) 
conservative particle system. 

First, remark that it is easy to design a SCA that simulates a conservative particle system. For 
instance, consider the following SCA F with states {0, 1}, random symbols j-^, •, — )•}, and radius k = 2. 
The Is represent the particles and the Os the empty cells. The random symbol represents the movement 
each particle is trying to make: stay for •; move right for — )■ to be performed if the right cell is and if 
this move does not induce any conflict with another particle; move left for -^ to be performed if the left 
cell is and if this move does not induce any conflict with another particle. Here is its local rule (we 
only give the neighbourhoods whose image are 1, the others have image 0): 

I c : * * 1 * * \ 



* * 1 1 



10 10* 



The images of all other patterns are (* stands for an arbitrary symbol). 





This SCA is clearly non-deterministic, number-conserving and surjective (each cell remains un- 
changed when its random symbol is •)■ 

Note that most interacting particle systems are not only number-conserving but also surjective. It 
turns out that no CFCA nor iterates of CFCA can express such systems. 

Lemma 5.3. If a CFCA is number conserving, then it is a deterministic map. 

Proof: 

Assume by contradiction that F is a number conserving CFCA which is not deterministic. Let c be a 
finite configuration such that there are c' 7^ c" such that Sf{c){{c'}) > and Sp{c){{c"}) > 0. Let i 
be such that c[ 7^ c'/. Then, since the updates are independent in CFCA: with positive probability, c is 
mapped to c' and with positive probability, c is mapped to c' where c^ = c'- and c'-,^ = c' . This is a 
contradiction since the total weight of c' and c' differ by c[ — c'/ / 0. D 

Theorem 5.4. Let F be a CFCA. If Sp is number-conserving and surjective for some t > 0, then F is 
deterministic. 

Proof: 

Assume by contradiction that F is non-deterministic. By Lemma |5.3[ F is not number conserving. 
Therefore there are finite c, c', c" such that Sf{c){{c/}) > 0, Sf{c){{c"}) > 0, and the weight of c' and 
c" differ. As F* is surjective, so is F*^^ and let d be a finite configuration such that 5^^(d)({c}) > 0. 
It follows that Sp{d){{c'}) > and Sp{d){{c"}) > which contradicts the fact that F* is number 
conserving. D 

Whereas the surjectivity constraint was not needed in the lemma, it is required for the theorem to 
hold. Indeed, the square of the CFCA illustrated in Fig. [T] is non-deterministic and number-conserving. 
This CFCA F has neighbourhood {— 5, . . . , 6} (a neighbourhood large enough to prevent unstable pat- 
terns from propagating). The only patterns yielding to a state change are: 

/(OOOOOllOOO**) = /(**0000001000) = 1 

/(*0000011000*) = or 1 with probability ^ /(**0001001000) = 

A cell matching any other pattern remains unchanged (* stands for an arbitrary symbol). Note that this 
CFCA is not surjective since no image configuration contains the pattern 0000011000. 

5.1.2. Generation of random words of fixed parity 



Let us consider again the SCA Parity which was used as an example in Section 3.3 It turns out that 
even iterates of CFCA having Q = {#,0,1} cannot reproduce this behavior. 



Theorem 5.5. For all F a CFCA with Qp = {#, 0, 1} and for all t, we have {SpY / Sp, 



'arity- 



Proof: 

Assume by contradiction that there is a CFCA F and t such that {SpY = SparUy Let k be the radius 
of F. Both images of 0^^^+^ and l^^^+i cannot be deterministic. Otherwise, F*([0^'^*+^]_fct)o would be 



il 

n nmniD n "^ transition occurlng with probability 1/2 

...... — ► deterministic transition 

0... 000110.. .0 

UtIU 
0...000010...0 

0... 010010.. .0 
Figure 1. An example of non-surjective non-deterministic CFCA whose square is number conserving. 

a deterministic value which contradicts the random generation of words of even parity starting from the 
pattern :^0^'^*+^#. Therefore F(0^'''+^) is with probabihty p > 0, and 1 with probability I - p > 0. 
Then, with positive probability q, F*"i([#02^*+^#]_rt) ^ [O^^+^fc and then the central cell of 0^''+^ 
is mapped independently to or 1 with probability p and 1 — p. It follows that there is a probability at 
least inm{qp, q{l — p)) > that the word #0^^*"''^# is mapped by F* to an odd parity word. D 

5.2. Decidability 

Theorem 5.6. Let F be a CFCA. It is decidable to test whether it is noisy. 

Proof: 

It is enough to check the local rule. Indeed, as opposed to SCA, a CFCA is noisy if and only if every 
neighbourhood has a positive probability to be mapped to every letter. D 

CFCA do not have the same expressiveness as SCA. However, their squares can introduce correla- 



tions, and can in fact simulate any SCA (see Theorem 7.9 1. In particular, deciding the equality of the 



stochastic functions of squares of CFCA is undecidable, as proven bellow: 

Theorem 5.7. Fix any dimension d ^ 2. It is undecidable whether a given CFCA F of dimension d is 
such that F"^ is a white noise CA (i.e. S'^{c) is the uniform measure for all c). It is also undecidable 
whether F^ is noisy. 

Proof: 



As in Theorem 4.1 we reduce from undecidability of determining the surjectivity of CA in dimension 
2 and higher. Let F be an automaton over Z'^ with states Q. We define the CFCA G over Z'^ with 
states Q X Q and random states Q as follows: G{{c,c'),s) = {F{c'),s). Then, G^((c, c'), s, s') = 
G{{F{c'),s),s') = {F{s),s'). It follows that, as in Theorem 4.1 Sq is the uniform measure over 
{Q X Q)^ , i.e. it is white noise, if and only if F is surjective. 

The same encoding shows undecidability of deciding if the square of CFCA is noisy because G^ is 
either white noise or non-noisy, whether F is surjective or not. D 

Corollary 5.8. It is undecidable to determine, given F and G two CFCA, whether Sp = Sq. 

Proof: 



From Theorem 5.7 is is sufficient to consider particular instances of the problem where G a fixed CFCA 



which is noisy. D 



6. Dimension 1 is simpler 

In this section, we restrict to dimension 1. 

6.1. Weighted De Bruijn Automata 

To any SCA F, syntactically given by (Q, R, V, V, /), we associate a weighted finite automaton Ap = 
(S, A, 6, w, iQ,F) whose weight are in Q (see EOl for an introduction to weighted automata). Intuitively, 
Af is a De Bruijn automaton recognizing pairs of configurations of the form {c,a^^^ o F(c, •)) and 
the weights correspond to probability distributions given by the stochastic global function Sp- More 
precisely, the shift of A: — 1 cells between the two components of the recognized pairs comes from the 
internal memory of the automaton which needs to be initialized as detailed below. 

First, we suppose without loss of generality that the neighbourhoods are of the form V = V' = 
{—k, . . . , k}. We let i = 2k + 1. Af works on alphabet Tj = QU Q x Q and its set of states is 

A= U Q^ X W 

By convention, we denote iq = (e, e) the only element of Q^ x R^. The transition relation 6 (^ AxT,xA 
is given by: 

• (initialization) for any j < £ — I, any state (a, b) G Q^ x R^ , any q £ Q, and any r (^ Rwe have 

{(a,b),q,{aq,br)) G S 

• (main component) for any ^-uples qi,. . . ,q£ G Q and ri, . . . , r^ G i?, we let a = [qi, f{q,r)) 
and we have 

{{qi---qe~i,ri---ri-i),a,{q2---q£,r2---ri)) G 6 

Finally, we let J^ = ^ \ {io} and weights are tw for all transitions. 

A path in Af is a sequence of transitions starting from iq and ending in T, such that any transition 
starts from the state where the previous one arrives. The word recognized by a path is the sequence of 
labels of transitions. The weight of a path is the product of the weights of transitions. The weight of a 
word u G S* , denoted u;_4^ (n) , is if n is not recognized by any path, and the sum of weights of paths 
which recognize u otherwise. 

By construction, the automaton Af recognizes only words of the form Q^^^{Q x Q) ■ All transi- 
tions have the same weight. But, two recognized words of same length do not have the same weight in 
general. 

Lemma 6.1. Let u ^ Q^ and v G Q^~^^~^ for some p ^ 1. Define m = vi- ■ ■ W£_i • 

(f£, ui) • • • (fp+£-i, Up). Then, for any configuration c G [w]o, we have wji^^ (m) = (^^(c)) ([m]^)- 

Proof: 

It is straightforward to check from the defintion of Af that the set of paths recognizing m is in one-to- 
one correspondance with the set of word u G RP+^-^ such that F(c, [v]q) ^ [u]k- Moreover, this set 



is exactly £c,[u\k (^^^ section 3.3 1 and the measure of this set for the uniform probability measure is by 
definition (SF{c)){[u]k)- Finally, since each path of length I has weight r^r, the weight wji^^ (m) of m 
is equal to the measure of £c,[u]h ^^^ "^^e uniform probability measure. The lemma follows. D 



Corollary 6.2. Equality of stochastic global function of ID CA is decidable. 

Proof: 

Consider two CA F and G. We can suppose that they have the same centered neighbourhood V 



{—k, ■ ■ ■ , A;} (if not simply increase syntactically neighbourhood apropriately). From lemma 6.1 it fol- 
lows that equality of Sp and Sq is equivalent to the equality of U7yi^ and wj(^, i.e. equality of the 
weighted languages of Ap and Ac- The problem of equivalence of weighted finite automata is decid- 
able for weights in Q 1201 . Since Ap and Ac are computable from F and G, the corollary follows. D 

6.2. Simplified automaton for Correlation-Free CA 

Consider a CFCA F , with A = {Q, R, V, {0}, /). The construction detailed in the above subsection 
gives a weighted automaton Ap = {^, A,S,w,io,T) which has some additional regularities due to 
correlation-freeness. Intuitively, memorizing the i^-component in states of the automaton is useless. 
We now construct a deterministic weighted finite automaton Bp = (S, B, 5' , w' , ig, T') which is equiv- 
alent to Ap. Bp is essentially a De Bruijn graph with an initialisation part. The weights are given by the 
map Ff associated to the local function / of F. Formally, Bp is defined as follows (again with £ = 2k + l 
and S = Q U Q X (5): 

• B= [J Q^; 

• ig = e (the single element of Q") and F = Q^~^; 

• w' : B X Ti X B ^ [0,1] gives implicitly 6' (all non-zero weight transitions) and is defined by: 

- for any i < £ — 1, any a G Q\ any q € Q we have: 

w'{a,q,{aq)) = 1; 

- for any qi, . . . ,q£ e Q we have: 

w'{{qi, ..., qe-i), [qi, q), {q2, • . . , qe)) = (P/(gi, . . . , qe)) (q) 

- any transition not mentioned above has weight 0. 

We define w^p : S* — )■ [0, 1] as just the product of weights of the transitions of the unique path 
labelled by u. Like Ap, Bp can be used to compute the probabilities involved in Sp. 

Lemma 6.3. Let u G Q^ and v € QP^^'^ for some p ^ 1. Define 

m = vi- ■ -Vi-i ■ {vi,ui) ■ ■ ■ {vp^f-i,Up). Then, for any configuration c G [v]o, we have 
WBirim) = {Sp{c))i[u]k). 

Proof: 

Straightforward from the construction of Bp and lemma 5.1 there is only one path in Bp recognizing m 
and, after the initialization part (f i • • • V£_i) of weight 1, the weight of the path is given exactly by the 
product of P J ( • ) appearing in lemma 5.1 D 



Armed with this construction, we will now reevalute the PPT problem in the special case of CFCA. 



Unlike in the general case of Theorem 4.9 the problem will turn out to be decidable. First, let us intro- 
duce some vocabulary and a structural lemma. 

We say that a word m = vi---V£^i • {v£,ui) ■ ■ ■ {v£^p^i,ui^p^i) of S* is valid if ui = 07-, 
«£+! = • • • = ue-^-p^2 = — ^. Ui-^-p-l = -/, and WBp{m) > 0. We say that a word is short if its 
length is at most £ — 1 + \Q\^~^ + 3, i.e. the number of states of the automaton plus three. We 
say that a word m = vi-- ■Vi-i{v£,ui) • • • {v£^q-i,Uq) e S* contains a loop of length k at po- 
sition i, with £ ^ i ^ \7n\ — k, if Vi-£^j = Vi^i^_£^j for all 1 ^ j ^ £ — 1; in this case, 
we say that the word / = {vi,Ui-£-^-l) ■ ■ ■ (uj+K_i,Ui_£_|_K) is a loop of m at position i. The quan- 
tity {wBpivi+^-£+i ■ ■■Vi+^-i{vi,Ui_£+i) ■ ■ ■ (t;i+K_i,Ui_£+K))) is conveniently called the linear 
weight lwBp{l) of the loop I. Lastly, we say that a loop / = {vi,ui) ■ ■ ■ {vk,Uk) is a valid loop if 
til = • • • = Wk = — ^- Note that if a valid word contains a loop / at position i with £ -\-l < i < \m\ — \l\, 
then for all g S N, the word w.i..j_i/''mj+|;| |„| is valid and has weight WBp{m) x /t(;e^(/)(''~^)l'L 

Lemma 6.4. Assume there is a valid word m', of length £ + p' — 1, having weight 
WBp{rn') > aXP for some a > and A < 1. Then there must also be a valid word 

m = f 1 • • • V£-i ■ {v£, U£) ■ ■ ■ (v£+p+i, u^+p-i), of length £ + p + 1, with weight WBp{m) > aX^ and 
such that at least one of the two following properties holds: 

• ?7i is short. 

• m consists in three parts: m = mil'^m2 where 

- mi = vi • • • V£-i ■ {v£, U£)--- (f^+a-i, U£+a-i) with 2 ^ a ^ \Qf~^ + 2 

- g G N and / is a valid loop of length at most IQI^^^ and with linear weight > A, and 

-1712 = {v£+p-b,U£+p-b) ■ ■ ■ {v£+p-i,ui+p^i) with 1^6^ \Qf'^- 

Proof of Lemma 1131 

Consider a vahd word m' = vi ■ ■ ■ V£-i ■ {v£, Ui) ■ ■ ■ (wp'+^_i, Up/^^-i) with weight WBp{m') > aX^ . 
If m' is short, take m = m'. 

Otherwise, consider the subword n' = {v2+£, U2+e) • • • (vp'+£-2) ^p'+^-2). of length p' — 3. Since m' is 
not short, p' +£ — 1 '^ £ — l + \Q\^~^ + 3, hence p' — 3 ^ | Q | ^~ ^ , therefore n' must contain some loop /' of 
length k'. Since m' is valid we have U£ = (3>. ui+£ =— )-, Up/^£-i = •/, and the rest of the u's are equal 
to — ;•, i.e. those of n' are equal to — )• and so the loop is valid. We construct m" by recursively removing 
all loops I' that have linear weight ^ A. The weight of m" = vi- ■ ■ ve-i • {v£, up) • ■ ■ (fp"-t-£_i, iip"+^_i) 
verifies WBp {m") > A^ as well, and m" is valid. If m" is short, take m = m". 

Otherwise, consider the subword n" = {v2+e, U2+£) ■ ■ ■ {vp"+£-2, «p"+£_2) of m", with length p" — 3. 
Again n" must contain some valid loop, and all its loops have hnear weight > A. We construct m from 
m" by recursively removing all but one loop, which we call I. This rh is again valid, and indeed of the 
form milm2 with nii and ?n,2 as specified in the lemma. Finally, since the linear weight of / is > A, 
one can choose (7 G N so that the weight of the valid word m = mil'^m2, of length £ — 1 + p, verifies 
WBpirn) > aXP. D 

The decidability result follows directly from this lemma. 



Theorem 6.5. The problem PPT is decidable for CFCA of dimension 1 when the threshold function ■§ 
verifies ■d{n) = aA" for some a > and A < 1 which may depends on the CFCA. 

Proof: 

We can decide PPT on input F by the following algorithm: 

1. Check if there is a short valid word m verifying WT3p{m) > aA'™'"^"^; if one is found answer 
YES; 

2. If none is found, check if there is a valid word m = milm2 with 2 ^ \mi\ ^ |Q|^~^ + 2, 
1 ^ |"i2| ^ IQI^ ^> and where I is a valid loop of length at most |(5|^^^ and with linear weight 
> A; if one is found answer YES else answer NO. 

D 

6.3. Non-deterministic global function and model checking 

In this section we will briefly show how model checking methods based on Biichi automata and already 
developped for ID deterministic CA ll30ll can be extended to non-deterministic automata. It is not the 
purpose of our paper to give a self-contained exposition of this topic and we refer the reader to HBTI ITOl 
for more details and extensions of this approach. 

The central concept here is that of uj-automatic structure. Intuitively it is a structure where objects 
are (semi-)infinite words over a finite alphabet A and relations, seen as languages of (semi-)infinite words 
over Ax A, we. all Biichi -recognizable. More generally, one can consider structures which are isomorphic 
to one of this form in order to allow other types of objects. In our case, objects are (bi-)infinite words 
and we consider the bijection : A^ -^ {Ax A)^ (for any finite set A) defined by: 

4){c) = n I— )■ {c{n), c{—nj) . 

Definition 6.6. A structure (^4^, {Ri)i^i^n), where each Ri is a relation of finite arity ki over A^, is 
u-automatic if for each i the following relation is Biichi-recognizable: 

{((/)(ci),...,</>(cfcj) : (ci,...,CfcJ G Ri}. 

Such a structure is finitely representable by the list of Biichi automata recognizing the relations. 

This definition is a particular case used in [ 10] of a more general notion introduced in [ 17 ] and further 
developped since then [21]. The interest of w-automatic structure relies in their decidability. Strangely 
enough, this decidability result is not always stated in its uniform version which is the most useful in 
the context of cellular automata (while 1 10] is not interested in uniformity, ll30l does not state uniform 
results but all the ingredients of uniformity are present in the proofs). 

Theorem 6.7. The first-order theory of w-automatic structures is uniformly decidable, i.e. there is an 
algorithm that take has input an a;-automatic structure, a first-order formula and decides whether the 
structure satisfies the formula. 



Proof: 

Although not stated exactly in this way, all details for the proof of this result are in iflTll . which is to 
our knowledge the first paper to introduce explicitely the notion of a;-automatic structure. In fact, all the 
tools needed to prove this result (essentially closure properties of Biichi automata and decidability of the 
emptiness problem for recognized language) were already present in the seminal work of Biichi in the 
60s. D 

We can now state a number of decidability results. Some of them can be compared with undecidabil- 
ity results in dimension 2 and the last two are 'unquantified' variants of the PPT problem which is also 



undecidable (see section 4.2 1. 



Corollary 6.8. The following problems are decidable for ID stochastic CA: 

1. given F, is it noisy? 

2. given F, is it surjective? 

3. given F, is it injective? 

4. given F, is it pre-injective? 

5. given F and G, do we have J\fp = MqI (where t is any fixed positive integer) 

6. given F, is there u ^ L and c such that A/i?(c) C [n]o? (where L is any fixed rational language) 

7. given F, is there u e L and c such that A/f(c) n [u]o / 0? (where L is any fixed rational lan- 
guage) 

Proof sketch: 

First, the following relations are Biichi-recognizable through encoding by (p (see |[30l [311 for detailed 
proofs about all of them except the last one): 

• X = y, 

• X ^ y, i.e. a; = y up to a finite number of differences, 

• X — T-i? y, i.e. y G Mf{x), 

• X -^ L, i.e. X G [u\o for some u £ L. 

Then all problems above can be expressed as a first-order formula in the apropriate w-automatic structure: 

1. \/y,\/x : X — T-i? y 

2. \/y, 3x : x -^p y 

3. Vy , \fxi , Vx2 : {xi —^-p y A X2 —^f v) ^ xi = X2 

4. Vy , Vxi , yx2 : {xi -^p y A X2 -^f y) =^ a^i ~ X2 



5. \/x,yy : {3zi,...,3zk-i,x^F A A ••• A Zk-i -^f v) <^ (3zi, . . . ,3zfc-i,x -^g zi A ■ ■ ■ A Zk-i -^g v) 

6. 3x, Vy : x -^p V =^ U "^ L 

7. 3x, 3y : x -^p y Ay -^ L 

Therefore, deciding one of the problem boils down to building the apropriate c<j-automatic structure and 



applying the algorithm of Theorem 6.7 D 



7. Intrinsic Simulation 

The purpose of this section is to give a precise meaning to the sentence "A is able to simulate B" or 
equivalently "A contains the behavior of B". 

Our approach follows a series of works on simulations between classical deterministic CA ||28l |23l 
|32l|5l|6l. We are going to define simulation pre-orders on stochastic CA which extend the simulation 
pre-orders defined over classical deterministic CA in 161 . More precisely, we want the new pre-order 
to be exactly the classical pre-order when restricted to deterministic CA. For general background and 
motivation behind this simulation pre-order approach we refer to miU. Intrinsic simulation has also 
been brought to deterministic quantum CA in |I21. 

In each case (the deterministic, the non-deterministic, and the stochastic global functions), we will 
define simulation as an equality of dynamics up to some local transformations. 

In this section and the next one, all CA considered are one-dimensional. There is no difficulty to 
extend all definitions and results to higher dimensions. 

7.1. Transformations 

The transformations we consider are natural stochastic extensions of the transformation defined in mm 
for the classical deterministic CA. These transformations can be divided into two categories: trimming 
operations which allow to trim unwanted parts off the dynamics, and rescaling transformations which 
augment the set of states and/or the neighborhoods. 

7,1.1. Trimming operations 

They are based on three ingredients: 1) renaming states; 2) restricting to a stable subset of states; and 
3) merging compatible states. These ingredients are synthetized into two definitions (state renaming is 
implicit in both definitions). 

Detinition 7.1. Let A = {Q, R, V, V, f) be a stochastic CA. 

• Hi : Q' ^ Q is an injective function such that Y = {i{Q'))^ is F-stable (i.e. F{Y, R^) C Y) 
then the i-restriction of A is the stochastic CA: 

iA={Q',R,V,V',if) 

where if is the local function associated with the explicit global function iF such that, Vc G /, 

Vs G R^, iF{c, s) = I^^ o F{I{c),s) where / : Q'^ — ^ Q^ denotes the cell-by-cell extension of 

i; 



• if vr : (5 — > Q' is surjective and F-compatible (s.t. Yl o F{c, s) = 11 o F{c' , s) for all s and all 
c,c' such that n(c) = n(c'), where 11 : Q^ — )■ Q'^ is the cell-by-cell extension of tt), then the 
n-projection of ^ is the stochastic CA: 

^A=iQ',R,V,V',^f) 

where '^f is the local function associated with the explicit global function ^F such that ^F{c' , s) = 
n o F{c, s) where c is any configuration in 11^^ (c'). 

Hi : Q' ^ Q and tt : Q' — )■ Q" verify the required stability and compatibility conditions, we denote 
by '^A the 7r-projection of the z-restriction of A. 

Definition 7.2. Let Ai = {Qi,Ri, Vi, V(, /i) and A2 = {Q2, R2, V2, V2, ^2) be two arbitrary stochastic 
CA. We define the following relations: 

s 

• ^1 !^ ^2, -4i is a stochastic subautomaton of A2, if there is some i-restriction of A2 such that 

S 

• Ai<A2,Aiis£L stochastic factor of A2, if there is some vr-projection of A2 such that Sp^ = S-^f2 '■• 

N N D D 

Similarly, we define C and < (for non-deterministic global maps) and C and < (for deterministic global 

D N S 

maps). We also define the three relations ^, <n and ^ using projections of restrictions. For instance: 

s 
Ai ^^2 if there are i and vr such that Sp^ = S^rp^. 

7.1.2. Reseating transformations. 

The transformations defined so far only allow to derive a finite number of CA from a given CA (up to 
renaming of the states) and thus induce only a finite number of dynamics. In particular, the size of the set 
of states, and the size of the neighborhood, can only decrease. Following the approach taken for classical 
deterministic CA, we now consider rescaling transformations, which allow to increase the set of states, 
the neighborhood, etc. Rescaling transformations consist in: composing with a fixed translation, packing 
cells into fixed-size blocks, and iterating the rule a fixed number of times. Notice that since stochastic 
CA are composable, they are stable under rescaling operations, whereas CFCA are not. 

The translation Oz (for z G Z) is the deterministic CA whose deterministic global function verifies: 
Vc,Vz',P^^(c)2' = c^/+2. 

Given any finite set S and any m ^ 1, we define the bijective packing map bm '■ S^ ^ (S"") by 
bm{c)z = (cmz, Cmz+1, ■■■, Cmz+m~i) for all c and z. 

Definition 7.3. Let A = {Q, R, V, V, f) be any stochastic CA. Let m,t ^ I and z G Z. The rescaling 
of A with parameters (m, t, z) is the stochastic CA ^<™.*.^) = (Q™, (i?™)*, y+, F|, /('».*'^>) whose 
explicit global function F^™'*'^^ is defined by: 

F<-'*'^)(c,.) = 5„oa,oF*(6„i(c),6„i(si),...,6-,i(s*)) 

where s^, . . . , s* G (i?™)^ are the t components of s (s.t. s\ = (sj)i), and V"_|_, F|_ the modified neigh- 
bourhoods following hm- 



7.2. Simulation Pre-Orders 

We can now define the general simulation relations. 



Definition 7.4. For each local relation ^ among the nine relations of Definition 7.2 we define the asso- 
ciated simulation relation =^ by 

We therefore define nine simulation relations ^f , =^f , ^'^, ^f, ^^ , =^^, ^P, =^^ and =^^, where the 
subscript denotes the kind of local relation used (injection, Tirojection or mixed) and the superscript de- 
notes the kind of global functions which are compared (Stochastic, Non-deterministic or Deterministic). 

Lemma 7.5. A restriction (resp. projection) of a restriction (resp. projection) of some stochastic CA A 
is a restriction (resp. projection) of A. Moreover, any restriction of a projection of A is the projection of 
some restriction of A. 

Proof: 

This is a straightforward generalization of the corresponding result in the classical deterministic settings. 
A detailed proof for the deterministic case appears in Theorem 2.1 of [6|. All arguments given in the 
proof are easily adaptable to our setting. D 

The lemma above implies that any sequence of admissible restrictions and projections can be ex- 
pressed as the projection of some restriction. 



From Lemma |7.5| it follows that all local relations defined are transitive and reflexive. Moreover, 

D D 

the deterministic relations C and < are exactly the same as those defined in the classical setting of 
deterministic CA |6l . 

Fact 7.6. All simulation relations ^f, ^f, ^^, ^f, ^^ , ^^, ^f , =^^, ^^ are pre-orders. 

Proof: 

It is sufficient to verify that for any local comparison relation ^ : 

1. ^ is compatible with rescalings, i.e. 

2. rescalings are commutative with respect to ^, i.e. 

Both properties are straightforward from the definitions. Then, the transitivity of any simulation relation 
follows from the transitivity of the corresponding local comparison relation ^. D 

Each stochastic pre-order is a refinement of the corresponding non-deterministic pre-order as shown 
by the following fact (straightforward corollary of Fact[34|). 



Fact 7.7. If Ai^fA2 then ^i^^^2- The same is true for pre-orders ^f , =^^ and the corresponding 
(non-)deterministic pre-orders. 

Note that for any simulation relation =^, .4i =^ ^2 means that two global functions are equal where 
one is obtained by applying only space-time-diagram-preserving rescaling transformations to Ai (the 
simulated CA) and the other is obtained by applying both rescaling transformations and trimming oper- 
ations to A2 (the simulator). 

7.3. Classifications of Stochastic Cellular Automata 

Simulation pre-orders can be seen as a tool to classify the behaviors of CA ||5j|6l. They can be used to 
formalize in a more precise way the empirical classes defined historically through experimentations. We 
now give some results on the structure induced on stochastic CA by this classification. 

Ideals. Some classes of stochastic CA may only simulate CA of their own class. This is the case of the 
deterministic CA and also of the class of the noisy CA which are the CA F such that Mric) = Q^ for 
allc. 

Fact 7.8. Let ^ be any non-deterministic or stochastic pre-order. Let Ai and A2 be stochastic CA such 
that Ai ^ ^2- If A2 is deterministic (resp. noisy) then Ai is deterministic (resp. noisy). 

Proof: 



By Fact 7.7 it is sufficient to prove this for non-deterministic simulations. The property that the explicit 
global function is deterministic or noisy (i.e. surjective on each configuration) is preserved by rescaling 
transformation. Hence it is sufficient to check that being deterministic or noisy is preserved by restriction 
and projection. This is straightforward for projection (because a projection is an onto map). Determinism 
is clearly preserved by restriction. Moreover, a noisy stochastic CA does not admit any non-trivial 
restriction because no subset of states is stable under iteration. Hence, the restriction of a noisy CA is 
necessarily itself (up to renaming of states) or the trivial CA with only one state. Both are noisy and the 
fact follows. D 

Simulation of stochastic CA by a CFCA. Even if some stochastic CA cannot be expressed as a CFCA 
(because of potential local probabilistic correlations), each can be simulated by a particular CFCA. 

Theorem 7.9. For any stochastic CA ^ = {Q, R, V, V, /a) there is a CFCA B such that A=4fB. 

Proof: 

The idea is to simulate one step of A by two steps of B: 

1. generate a random symbol locally and copy it to a component of states; 

2. simulate a stochastic transition of A reading states only and ignoring random symbols. 

Formally, let B = {Qb, R, V-, VjIb) where Qb = QU Q x R and Jb is any local function such that 
the associated explicit global function Fb verifies: 



1. for any c G Q^ C Q| and any s G R^, {Fb{c, s)) __ = (c^, s^) 

2. for any c G (Q x i?)^ C Q^ and any s G R^, Fb{c, s) = Fa{t^q{c)^t:r{c)) where vtq and vr/j 

are cell-by-cell projections on Q and R respectively. 

g 
It is straightforward to check that AQ B^ with the restriction induced by the identity injection 

Note that the restriction is essential in the above construction since the behavior is not specified (and 
no correct behavior can be specified) on configurations where states of type Q and states of type Q x R 
are mixed. In particular it is false that the stochastic CA is the square of the CFCA; it is a restriction of 
that. 

Still, one could think that we might achieve a simpler simulation by taking Qb = Q x R and doing 
the two steps simultaneously so that Fb{c, s) would be the cell by cell product of F/i(7rQ(c), 7r^(c)) and 
s. But this does not work: for such a B there is generally no restriction nor projection nor combination 
of both able to reproduce the stochastic global function of A. Indeed, if some c and si, S2 are such that 
Fa{c, Si) / Fa{c, S2) there is no valid way to define a corresponding configuration for c in Fb because 
the Q-component of states in Fb depends only on the previous deterministic configuration, not on the 
random configuration. Then, one might see this impossibility as an argument against our formalism of 
simulation. Of course, many extensions of our definitions might be considered to allow more simulations 
between stochastic CA. However, we think that the random component of the simulated CA should 
never be used to determine which deterministic configuration of the simulator CA corresponds to which 
deterministic configuration of the simulated CA. Doing so would be like predicting the noise of a system 
to prepare the state of another system. In particular, we do not see any reasonable formal setting where 
Fb defined as above would be able to simulate F4. Fa and Fb might look like two syntactical variants 
of essentially the same object, but, as stochastic dynamical systems, they are very different. For instance, 
not every configuration can be reached from any configuration in Fb whereas Fa could have this property 
(i.e. be a noisy stochastic CA). 

We believe that a better understanding of the relationship between stochastic CA and CFCA should 
go through the following questions: Is there a CFCA in any equivalence class induced by the pre-order 
^f ? Is any stochastic CA =^f -simulated by some CFCA? 

8. Universality 

The quest for universal CA is as old as the model itself. Intrinsic universality has also a long story as 
reported in |[25l . Our formalism of simulation allows to extend the quest to stochastic cellular automata. 
Indeed, one of the main by-products of each simulation pre-order defined above is a notion of intrinsic 
universality. Formally, given some simulation pre-order =^, a stochastic CA A is ^-universal if for any 
stochastic CA B we have B ^ A. When considering deterministic pre-orders, we recover the notions of 
universality already studied in the literature for classical deterministic CA Il24ll26l l6l. 

8.1. Negative results 

When considering non-deterministic or stochastic global functions, the random symbols are hidden. Still, 
the choice of the set of random symbols plays an important role in the global functions we can possibly 



obtain. We denote by J'3~(n) the set of the prime factors of n. By extension, for a stochastic CA A with 
set of random symbols R, we denote by 73^ {A) the set J'3'(|i?|). We have the following result: 

Lemma 8.1. Let ^i = (Q, Ri,Vi, V(, /i) and A2 = {Q, R2, V2, V2, f2) be two stochastic CA with 
same set of states. If they are not deterministic and Sp^ = Sf2 then 'S''J{Ai) fl 1PJ(^2) / 0- 

Proof: 

If Ai is not deterministic, then there must exist some configuration c G Q^ and two configurations 
y ^ y' such that {y,y'} ^ Mpiic). So there are two disjoint cylinders [u\z n [u']^ = with y € [u\z 
and y' G [u']z- Therefore < {^Sf{c)) {[u]z) < 1. Besides, by definition of 5^, we have 

for some relatively prime numbers p and q (recall that ui is the uniform measure over Rf and that 
£;! j^j^ ={seRf : Fi(c, s) G [u]z}). Moreover, TJ(g) C ^JdiJi]) = 'PS'iAi) since f J [„]^ is a finite 
union of cylinders and since the z^i -measure of any cylinder is a rational of the form j^^ for some 
integers a,b^ 1. Now, by hypothesis, we have also 

and by a similar argument as above we deduce that 'J'3^{q) C y3"(^2)- The lemma follows since J'3"(g) / 
(because |J < 1). □ 



From Lemma 8.1 it follows, surprisingly perhaps, that the random symbols of a stochastic CA limit 



its simulation power to stochastic CA that have compatible random symbols. 

Theorem 8.2. Let s^ be any stochastic simulation pre-order, and ^1 and A2 two stochastic CA which 
are not deterministic. If ^1 =^ A2 then 'P3'{Ai) n 'PJ{A2) / 0- 

Proof: 

Trimming operations (restrictions and projections) do not modify the set of random symbols. Rescaling 
transformations modify the set of random symbols in the following way: R t-^ R^ for some integer 
n. Therefore such transformations preserve the set of prime factors J'9"(^) of the considered CA A. 
Moreover, rescaling transformations do not affect determinism: the rescaled version of a CA which is 
not deterministic cannot be deterministic. Hence, the relation ^1 =^ A2 implies an equality of stochastic 
global functions of two CA which have the same prime factors as Ai and A2, one of which is not 
deterministic. Therefore none of them is deterministic and the theorem follows from lemma [8?T] D 

The consequence in terms of universality is immediate and breaks our hopes for a stochastic univer- 
sality construction. 

Corollary 8.3. Let s^ be any stochastic simulation pre-order. There is no =<; -universal stochastic CA. 



8.2. Positive results 

Still, the negative result of Corollary [83] leaves open the possibility of partial universality constructions. 
We will now describe how to construct a stochastic CA which is ^^-universal (hence also =^^-universal; 
however note that the existence of a ^^- or even of a =^^-universal is still open), and then draw the 
consequences. 

Since we are not concerned with size optimization, we will use simple construction techniques using 
parallel Turing heads and table lookup as described for classical deterministic CA in ||25l . More precisely, 
we construct a stochastic CA U = {Qu, Ru, Vu, ^1/: fu) able to ^^-simulate any stochastic CA A = 
{Q, R, V, V , /) with no rescaling transformation on A and no shift in the rescaling ofU. Therefore each 
cell of A will be simulated by a block of m cells of U and each step of A will be simulated by t steps of 
U (t and m depend on A and are to be determined later). 

The blocks of m cells have the following structure (the restriction in the pre-order handles the trimming 
of any invalid block): 



SYNC transition table Q-state i?-symbol Q-states of neighbors R-symbols of neighbors 



where each part uses a fixed alphabet (independent of Q and R) and only the width of each part may 
depend on A. To each such block is attached a Turing head which will repeat cyclically a sequence of 
4 steps (sub-routines) described below. On a complete configuration made of such blocks there will be 
infinitely many such heads (one per block) executing these steps in parallel. Execution is synchronized 
at the end of each step (SYNC part) and such that two Turing heads never colhde. More precisely, for 
some steps (2 and 4) the moves of all heads are rigorously identical (hence synchronous and without 
head collision). For some other steps (1 and 3), the sequence of moves of each head depends on the 
content of their corresponding block but these steps are always such that the head does not go outside 
the block (hence no risk of head collision) and they are synchronized at the end by the SYNC part which 
implements a small time countdown initialized to the maximum time needed to complete the step in the 
worst case. The parts holding i?-symbols are initially empty (uniformly equal to some symbol) for each 
block. The 4 steps are as follows: 

1. generate a string representing a random i?-symbol in the i?-symbol part using (possibly several) 
random i?t/-symbols present in that part of the block; 

2. copy the i?-symbol part to the appropriate position in the i2-symbols of neighbors part of each 
neighboring block. Do the same for Q-state; 

3. using information about Q-states and i?-symbols in the block, find the corresponding entry in the 
transition table and update the Q-States part of the block accordingly; 

4. clean i?-symbol and i?-symbols of neighbors parts (i.e. write some uniform symbol every- 
where). 

This construction scheme is very similar to the one used for classical deterministic CA but two points 
are important in our context: 

• step 4 is here to ensure that each configuration of A has a canonical corresponding configuration 
ofU made of blocks where the parts holding i?-symbols is clean (i.e., step 4 is required for the 
existence of the injection i); 



• depending on the way we generate strings representing an i?-symbols from strings of i?;7-symbols 



in step 1, we will obtain or not a uniform distribution over R (recall Theorem 8.2 1. 



In the general case, we can always fix (by the means of the injection i) a width large enough for the 
i?-symbols parts containing so that all iZ-symbols can be obtained (but with possibly different probabil- 
ities). We therefore obtain a universahty result for non-deterministic simulations. 

Theorem 8.4. Let =<( be either ^f or ^^. There exists a =^-universal CA. 

Note that this =<( -universal CA is a CFCA, and we obtain thus a stronger version of the simulation 



mentioned in Section |73] page \S0\ 

Now, if we are in a case where 'J'3^{A) C 'P3^[U) then it is possible to choose a generation process in 
step 1 such that each i?-symbol is generated with the same probability. We therefore obtain an optimal 
partial universality construction for stochastic simulations. 



Theorem 8.5. Let =<( be either ^f or ^^. For any finite set P of prime numbers, there is a stochastic 
CA Up such that for any stochastic CA A: 'J'3'{A) ^ P =;. A^Up. Moreover, Up is a CFCA. 



9. Recap of Some Decidability Results 

Problem General Correlation-Free 



ID 



Correlation-free ID 



F deterministic 


Decidable 


Decidable 


Decidable 


Decidable 


Sp = Sg 


Undecidable 


Decidable 


Decidable 


Decidable 


F noisy 


Undecidable 


Decidable 


Decidable 


Decidable 


c2 _ c2 
Op - Dg 


Undecidable 


Undecidable 


Decidable 


Decidable 


F^ noisy 


Undecidable 


Undecidable 


Decidable 


Decidable 


PPT 


Undecidable 


7 


Undecidable 


Decidable 



10. Open Problems 

Intrinsic simulation has been proven to be a powerful tool to hierarchize behaviors in the deterministic 

world. In particular, the notion of universal CA allows to formalize the concept of "most complex" CA 

as the ones concentrating "all the possible behaviors" within a given class ||5l|6l. The formalism and the 

notion of intrinsic simulation developed here for stochastic CA, enable us to export this classification 

tool to the stochastic world. In particular, it would be interesting to see whether our partial universality 

construction relates to experimentally observed classes, as in [29|. 

It would also be interesting to extend these notions of intrinsic simulation between stochastic C A to noisy 

Quantum Cellular Automata, as this could be of use for quantum simulation. 

At the theoretical level, and amongst all the concrete questions raised by this article, the following ones 

are of particular interest: 



• Is there for any stochastic A, a CFCA B which is =^f -equivalent to ^? 

• Is there for any stochastic A, a CFCA B such that A^^B? 

• Are there =^^-universal cellular automata? 

• Are universal CA the same for pre-order ^f and =^^? 

• Is PPT undecidable for CFCA in dimension 2 and higher? 

Our setting can also be generalized by taking any Bemouilli measure on the i?-component (instead 
of the uniform measure). We believe that positive and negative results about universality essentially still 
hold but under a different form. 

More generally, as far as we know, there is no characterization of the probability distributions over 
the configurations that correspond to images of cellular automata: deterministic automata starting from a 
random initial configuration, nor stochastic cellular automata starting from fixed or random distribution. 
In particular, we failed in our attempts to obtain an "Hedlund-like" characterization |[T6l of the stochastic 
maps corresponding to stochastic cellular automata (recall that there are constant stochastic maps which 
do not correspond to any stochastic cellular automaton). One possible direction might be to explore 
extensions of our framework allowing arbitrary shift-invariant distributions for the i2-configuration; a 
characterization of this extension would however be still unsatisfying since many shift-invariant distri- 
butions are highly non-local and are thus only remotely related to cellular automata. 
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